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A genome-wide association study identified single nucleotide polymorphisms (SNPs) rs3077 and rs9277535 located in the 
3 untranslated regions of human leukocyte antigen (HLA) class II genes H LA-DP A1 and HLA-DPB1, respectively, as the 
independent variants most strongly associated with chronic hepatitis B. We examined whether these SNPs are associated with 
mRNA expression of HLA-DPA1 and HLA-DPB1. We identified gene expression-associated SNPs (eSNPs) in normal liver 
samples obtained from 651 individuals of European ancestry by integrating genotype (~ 650 000 SNPs) and gene expression 
(>39000 transcripts) data from each sample. We used the Kruskal-Wallis test to determine associations between gene 
expression and genotype. To confirm findings, we measured allelic expression imbalance (AEI) of complementary DNA 
compared with DNA in liver specimens from subjects who were heterozygous for rs3077 and rs9277535. On a genome-wide 
basis, rs3077 was the SNP most strongly associated with HLA-DPA1 expression (p= 1Ct 48 ), and rs9277535 was strongly 
associated with HLA-DPB1 expression (p= My 15 ). Consistent with these gene expression associations, we observed AEI for 
both rs3077 (p= 3.0 x 1Ch 7 ; 17 samples) and rs9277535 (p= 0.001; 17 samples). We conclude that the variants previously 
associated with chronic hepatitis B are also strongly associated with mRNA expression of HLA-DPA1 and HLA-DPB1, 
suggesting that expression of these genes is important in control of HBV. 
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Introduction 

Chronic infection with hepatitis B virus (HBV) is the 
most common cause of liver cancer worldwide, as well as 
a major risk factor for development of cirrhosis and end- 
stage liver disease. 1 The hepatitis B vaccine is highly 
effective in preventing new infections, but ~ 360 million 
people still suffer from chronic hepatitis B and there are 
~ 600 000 annual deaths from HBV-related causes. 1 
Recent genome-wide association studies identified single 
nucleotide polymorphisms (SNPs) located within the 
human leukocyte antigen (HLA) class II genes HLA- 
DPA1 and HLA-DPB1 to be associated with chronic 
hepatitis B. 2 - 3 Replication studies performed on two of 
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these SNPs (rs3077 and rs9277535) confirmed and 
strengthened results from the genome-wide association 
studies. 2 Class II HLA genes encode proteins expressed 
on the surface of antigen-presenting cells such as 
macrophages, dendritic cells and B cells, and thereby 
have a critical role in presentation of antigens to CD4 + 
T-helper lymphocytes. HLA genes have many structural 
variants that have been linked to immune response to 
infectious agents, 4 but genetic variants that influence 
HLA mRNA expression might also affect antigen 
presentation and many 'gene expression-associated 
SNPs' (eSNPs) have been found for HLA genes. 5 - 6 An 
integrated approach combining genotype information 
with genome-wide gene expression data in relevant 
tissues can identify genetic variations that are both 
regulatory and disease causing. 6 - 7 In this study, we 
examined whether SNPs implicated in chronic hepatitis 
B are associated with mRNA expression in liver samples 
from the Human Liver Cohort, a large study of genotype 
and gene expression in normal liver. 6 Additionally, 
we performed confirmatory allelic expression imbalance 
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(AEI) studies in human liver and peripheral blood. 
We found that SNPs associated with chronic hepatitis B 
are strongly associated with decreased expression of 
important antigen-presenting molecules, HLA-DPA1 and 
HLA-DPB1. 



Results 

Genetic variants and mRNA expression 
A published genome-wide association study for chronic 
hepatitis B identified 11 disease-associated SNPs; con- 
firmatory replication studies were performed on rs3077 
and rs9277535 (ref. 2). SNP rs3077 is located in the 3' 
untranslated region (3' UTR) of HLA-DPA1, rs2395309 
lies ~6.5kb downstream of HLA-DPA1 and rs2301220 is 



found in the first intron of HLA-DPA1 . These SNPs are in 
complete linkage disequilibrium (LD) in Europeans 
(CEU HapMap samples, r 2 = 1.0; Figure la) and in near 
complete LD in Asians (JPT + CHB HapMap samples, 
r 2 = 0.93-0.97, Figure lb). Among the ~ 650 000 SNPs that 
were genotyped and examined for association 
with expression of HLA-DPA1, rs2395309, rs3077 and 
rs2301220 were the variants most strongly associated 
with mRNA expression of this gene (p~10~ 48 ; Table 1). 
Of the eight remaining SNPs that were associated with 
chronic hepatitis B, 2 five were associated with expression 
of HLA-DPA1 as well (Table 1). 

Figure 2a shows odds ratios for chronic hepatitis B 2 
and differences in mean-log gene expression by rs3077 
genotypes. Gene expression was measured relative to a 
pool of control liver samples, 6 and the risk genotype 
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Figure 1 LD (r 2 ) between SNPs associated with chronic hepatitis B: (a) European samples (CEU, HapMap); (b) Asian samples (JPT and CHB, 
HapMap). 
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Table 1 SNPs associated with chronic infection with HBV 2 and with mRNA expression of HLA-DPA1 and HLA-DPB1 in 651 human liver 
tissue samples 



SNPs 


HBV GWAS 2 
P-vdue 


HLA-DPA1 Expression in Liver 


HLA-DPB1 Expression in Liver 


Location within HLA region 


Rank 




Rank 


1 UUllA-C 




1.16E-13 


1 


1.24E-48 




NS 


T^T-'A T nnwnctroam 
1_V± / V 1 , \X\J W 1 IS 11 Cdll l 


rs3077 


1.59E-13 


1 


1.24E-48 




NS 


DPA1, 3' UTR 


rs23ul220 


1.81E-13 


5 


4.13E-47 




NS 


DPA1, intron 1 


rs9277535 


2.14E-12 


17 


1.92E-08 


2 


2.28E-15 


DPB1, 3' UTR 


rs3117222 


1.29E-10 


14 


9.76E-10 




NS 


DPB1, downstream 


rs3128917 


1.98E-10 


15 


9.76E-10 




NS 


DPB1, downstream 


rs3135021 


1.63E-08 




NS 




NS 


DPB1, intron 1 


rs9380343 


1.66E-08 




NS 




NS 


DPB2, upstream 


rs9277341 


1.84E-08 


7 


1.32E-22 




NS 


DPA1, intron 1 


rsl0484569 


2.34E-08 




NS 




NS 


DPB1, downstream 


rs2281388 


3.62E-08 


21 


1.98E-06 




NS 


DPB1, downstream 



Abbreviations: GWAS, genome-wide association study; HBV, hepatitis B virus; HLA, human leukocyte antigen; SNPs, single nucleotide 
polymorphisms . 

Rank indicates position among all cis SNPs that were tested for expression; P-values are for association of SNPs with chronic hepatitis B based 
on the GWAS 2 or with gene expression; NS indicates a non-significant P-value (>5.0 x 10 5 for czs-effect (probe to SNP distance <1 Mb)). 
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rs3077 Genotype 
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Figure 2 Association of rs3077 and rs9277535 with risk of chronic 
HBV infection 2 (blue) and mRNA expression of HLA-DPA1 and 
HLA-DPB1 in 651 human liver tissue samples from the Human 
Liver Cohort (red). 6 (a) Odds ratios for chronic HBV 2 and difference 
in expression of HLA-DPA1 in liver tissue, by rs3077 genotype, (b) 
Odds ratios for chronic HBV 2 and difference in expression of HLA- 
DPB1 in liver tissue, by rs9277535 genotype. Gene expression values 
represent the arithmetic increase in mean-log gene expression 
compared with the risk genotype (rs3077GG or rs9277535GG) 
groups. 



for chronic HBV (GG) is set to zero as a reference. 
The rs3077-G allele was associated with both higher risk 
of chronic hepatitis B and lower expression of HLA-DPA1 
in a pattern consistent with an additive genetic model. 



Similarly, SNP rs9277535, located within the 3' UTR of 
HLA-DPB1 (~22kb from rs3077), was strongly asso- 
ciated with both increased risk of chronic hepatitis B and 
decreased expression of HLA-DPB1 (ft = 10~ 15 ; Table 1, 
Figure 2b). There is only weak LD between rs9277535 
and rs3077 in Europeans (r 2 = 0.09, D' = 0.50; HapMap 
CEU; Figure la) and Asians (r 2 = 0.24, D' = 0.54; HapMap 
JPT + CHB; Figure lb), suggesting that effects of these 
SNPs are likely to be independent. 

To confirm that rs3077 and rs9277535 are associated 
with increased mRNA expression of their respective 
genes, we also examined genotype and gene expression 
data from a publically available database of 400 Epstein- 
Barr virus-transformed lymphoblastoid cell lines ob- 
tained from British children with asthma. 5 Among these 
subjects, rs3077 was strongly associated with HLA-DPA1 
expression (p = 10" 21 ) and rs9277535 with HLA-DPB1 
expression (p = 10~ 12 ). 

Allelic expression imbalance 

In the Human Liver Cohort, mRNA expression was 
measured by a microarray. It has been suggested that 
genetic and structural variations within the HLA region 
could affect efficiency of hybridization of microarray 
probes, 8 - 9 and, if so, differences in mRNA levels could 
reflect differences in detection efficiency rather than true 
differences in gene expression. We reasoned that if a 
genetic variation is truly associated with altered gene 
expression in cis, samples from heterozygous individuals 
should display AEI in complementary DNA (cDNA) 
when compared with DNA. 10 The location of rs3077 and 
rs9277535 in transcribed (3' UTR) regions of HLA-DPA1 
and HLA-DPB1 makes these SNPs good candidates for 
AEI testing. Furthermore, AEI Taqman assays for these 
SNPs were designed to avoid any other underlying 
genetic variation in the amplicons. Therefore, AEI is an 
independent test for possible differences in gene expres- 
sion associated with genetic variants. Consistent with the 
mRNA expression associations found in the Human 
Liver Cohort, we observed AEI for both rs3077 (liver 
tissue (n = 17), p = 3.0 x 10~ 7 , Figure 3a) and monocytes 
((ft = 22), p = 2.0 x 10" 8 , Figure 3b) and rs9277535 (liver 
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Figure 3 AEI in heterozygous DNA and cDNA in human samples: (a) liver (ft - 
(n = 22) for rs3077; (d) monocytes (ft = 19) for rs9277535. 
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tissue (n = 17), p = 0.001, Figure 3c) and monocytes 
((ft = 17), p = 0.04, Figure 3d). For both SNPs, the pro- 
portion of non-risk allele A was significantly increased in 
cDNA compared with DNA of heterozygous samples, 
indicating reduced expression of the risk alleles. AEI for 
rs3077 was not explained by rs9277535 and vice versa 
(Supplementary Figure 1), although the statistical power 
for those comparisons was more limited than for the 
primary comparisons. Therefore, it is likely that rs3077 
and rs9277535 (or variants in high LD with these 
SNPs) exhibit independent effects on the expression of 
HLA-DPA1 and HLA-DPB1, respectively. 



Discussion 

On the basis of findings for both genome-wide mRNA 
expression and allelic expression imbalance, it seems that 
SNPs rs3077 and rs9277535 are strongly associated with 
regulation of HLA-DPA1 and HLA-DPB1, respectively. 
Previous studies found that these SNPs were highly 
associated with chronic HBV infection. 2 - 3 Together, these 
independent studies strongly implicate lower expression 
of HLA-DPA1 and HLA-DPB1 with increased risk of 
chronic HBV. 

HLA-DPA1 and HLA-DPB1 are expressed on the 
surface of antigen-presenting cells. In the liver, expres- 
sion of these proteins is likely to be limited to a small 
population of Kupffer cells, the resident macrophages of 
the liver. Kupffer cells are derived from blood mono- 
cytes, and we observed evidence for AEI in both 
monocytes and liver samples. We found that mRNA 
expression of both HLA-DPA1 and HLA-DPB1 was low in 
total human liver tissue, suggesting that it might be 
limited to a specific sub-population of cells. We attemp- 
ted to quantify protein expression in lysates from total 
liver tissue, but failed to detect a measurable signal (data 
not shown). Similarly, HLA-DPA1 and HLA-DPB1 



protein expression was undetectable in liver tissue 
examined by antibody-based proteomic methods. 11 It is 
intriguing that the genetic risk variants associated with 
chronic hepatitis B affect expression of the alpha (DPA) 
and beta (DPB) chains of the same antigen-presenting 
complex, as this suggests that insufficient expression of 
either or both of these chains might result in the same 
phenotype. 

Kamatani et al 2 reported that chronic hepatitis B was 
associated with haplotypes comprised of rs3077-G, 
rs9277535-G and certain structural variants of HLA-DPA1 
and HLA-DPB1. Our findings do not exclude a role for 
structural variants of HLA-DPA1 and HLA-DPB1 in 
chronic hepatitis B, as viral control of HBV could be 
affected by both regulatory and structural variants. For 
example, functional variants of MBL2, which has an 
important role in host defense against HIV-1 and other 
infectious agents, include both regulatory and structural 
variants. 12 On a genome-wide basis, rs3077 is the variant 
most strongly associated with risk of chronic hepatitis B 
and the variant most strongly associated with expression 
of HLA-DPA1. It seems highly likely, therefore, that 
expression of HLA-DPA1 has a role in the clearance of 
HBV. Similarly, the association of rs9277535 with chronic 
HBV and with HLA-DPB1 expression suggests that HLA- 
DPB1 expression also contributes to chronic HBV infection. 

Kamatani et al 2 noted that the HBV risk alleles for 
rs3077 and rs9277535 are more common in Asians than 
Europeans (fitting the global pattern for the epidemio- 
logy of chronic hepatitis B). LD patterns in this genomic 
region are very similar for populations of Asian or 
European ancestry groups (Figures la and b), indicating 
that the relationships between SNPs in this region are 
comparable between these two ancestral groups even 
though the allele frequencies differ. 

We could not determine why the rs3077 and rs9277535 
variants are associated with decreased mRNA expression 
of HLA-DPA1 and HLA-DPB1 in liver, but variants within 
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the 3' UTR may affect mRNA stability through binding of 
regulatory factors or regulation by microRNAs. 13 SNP 
rs3077 was found to be associated with methylation level 
of HLA-DPB1 and HLA-DPA1 in adult cerebellum 
samples studied by Zhang et al. 14 (see Supplementary 
Table 5 of that paper). It may also be noteworthy that 
rs9277535 was linked to two different copy-number 
variation regions in European HapMap subjects. 15 
We did not observe any obvious factors that could 
explain differential expression of HLA-DPA1 and 
HLA-DPB1 mRNA, but future studies should address 
this question. 

SNPs rs3077and rs9277535 have been associated with 
other diseases. The rs9277535 variant has been linked to 
primary biliary cirrhosis. 16 Both rs3077 and rs9277535 have 
been associated with rheumatoid arthritis, 17 although other 
SNPs in the major histocompatibility complex class II 
region were more strongly associated with that disease. 

In conclusion, genetic variants previously associated 
with chronic hepatitis B 2 are also associated with 
decreased expression of an antigen-presenting complex 
that consists of HLA-DPA1 and HLA-DPB1 chains. 
Together, these independent studies strongly implicate 
lower expression of HLA-DPA1 and HLA-DPB1 as a 
factor for increased risk of chronic hepatitis B. Other 
recent studies have demonstrated a relationship between 
expression of HLA-C and control of HIV. 1819 It is possible, 
therefore, that HLA expression has a role in control of a 
range of viruses. More broadly, our findings lend 
support to the concept that SNPs identified through an 
integrated genomic approach can provide both confirma- 
tion and functional insights for disease associations. 6 - 20 
If greater expression of HLA-DPA1 and HLA-DPB1 
facilitate clearance of HBV infection, development of 
therapies that increase expression of these genes might 
aid in treatment of chronic hepatitis B. 



System (LTCDS). Anonymized fresh peripheral blood 
samples from healthy donors were provided by the 
Blood Bank at the NIH. Monocytes were purified from 
fresh blood with CD14 + -coated magnetic MicroBeads 
with AutoMacs (Miltenyi Biotec, Auburn, CA, USA) as 
previously described. 21 DNA from all samples was 
prepared with DNAeasy kit (Qiagen, Valencia, CA, 
USA) and RNA was prepared with Trizol reagent 
(Invitrogen, Carlsbad, CA, USA) followed by RNAeasy 
kit (Qiagen). The AEI was quantified with the allelic 

discrimination genotyping assays C 11916951_10 for 

rs3077 and a custom-designed assay for rs9277535. To 
ensure that the signal was specific for cDNA and was not 
derived from residual DNA in these samples, a 'no 
reverse-transcriptase' control reaction was tested initi- 
ally. Genotyping for all DNA and cDNA samples was 
performed in duplicate on the 7900 Sequence Detection 
System (ABI, Foster City, CA, USA). 

AEI experiments were carried out as previously 
described. 22 - 23 A standard curve was prepared based on 
10 dilutions of two homozygous DNA samples prepared 
from liver tissue, representing 5, 15, 25, 35, 45, 55, 65, 75, 
85 and 95% of allele 'X', based on the standard curve. The 
potential AEI in samples heterozygous for rs3077 and 
rs9277535 was evaluated by comparing the proportion of 
allele 'X' in DNA with that in cDNA with a paired two- 
sided T-test. To evaluate the potential effect of one SNP 
on AEI for the other (for example, effect of rs3077 on AEI 
for rs9277535), we tested whether the proportion of allele 
'X' in samples heterozygous for rs9277535 differed by the 
genotype of rs3077. Using an F-test, we compared the 
proportion of allele 'A' in all samples that were 
heterozygous for rs9277535 with the proportion of allele 
'A' in samples that were heterozygous for rs9277535, but 
homozygous for rs3077. 



Methods 

Genome-wide association of SNPs with mRNA expression 
As previously described, 6 the Human Liver Cohort 
identifies eSNPs by integrating genotype and gene expres- 
sion data for liver samples without evidence of liver 
disease. The current analysis is based on 651 liver samples 
obtained from patients of non-Hispanic European ancestry. 
Genome-wide gene expression was measured with cus- 
tom-designed Agilent (Agilent Technologies, Inc., Santa 
Clara, CA, USA) arrays (>39000 transcripts), and geno- 
typing was performed with the Alumina (San Diego, CA, 
USA) Sentrix HumanHap650Y genotyping chips. 6 The 
Kruskal-Wallis test was used to determine association 
between adjusted expression traits according to an additive 
genetic model. False discovery rate threshold was set 
at 10%. Associations were considered significant at 
p<5.0xl0~ 5 for ds-factors (probe to SNP distance 
<lMb) and p<1.0 x 10" 8 for trans -factors. 6 Only SNPs 
used both in genome-wide association studies for chronic 
HBV 2 and the mRNA expression study 6 were used in this 
analysis. For each of the SNPs, gene expression values were 
compared with the referent non-risk genotypes. 

Allelic expression imbalance 

Liver tissue samples for AEI studies were non-cancerous 
specimens provided by the Liver Tissue Cell Distribution 
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